Inter-Method Performance Study of Tumor Volumetry Assessment on Computed Tomography Test-Retest Data.

نویسندگان

  • Andrew J Buckler
  • Jovanna Danagoulian
  • Kjell Johnson
  • Adele Peskin
  • Marios A Gavrielides
  • Nicholas Petrick
  • Nancy A Obuchowski
  • Hubert Beaumont
  • Lubomir Hadjiiski
  • Rudresh Jarecha
  • Jan-Martin Kuhnigk
  • Ninad Mantri
  • Michael McNitt-Gray
  • Jan H Moltz
  • Gergely Nyiri
  • Sam Peterson
  • Pierre Tervé
  • Christian Tietjen
  • Etienne von Lavante
  • Xiaonan Ma
  • Samantha St Pierre
  • Maria Athelogou
چکیده

RATIONALE AND OBJECTIVES Tumor volume change has potential as a biomarker for diagnosis, therapy planning, and treatment response. Precision was evaluated and compared among semiautomated lung tumor volume measurement algorithms from clinical thoracic computed tomography data sets. The results inform approaches and testing requirements for establishing conformance with the Quantitative Imaging Biomarker Alliance (QIBA) Computed Tomography Volumetry Profile. MATERIALS AND METHODS Industry and academic groups participated in a challenge study. Intra-algorithm repeatability and inter-algorithm reproducibility were estimated. Relative magnitudes of various sources of variability were estimated using a linear mixed effects model. Segmentation boundaries were compared to provide a basis on which to optimize algorithm performance for developers. RESULTS Intra-algorithm repeatability ranged from 13% (best performing) to 100% (least performing), with most algorithms demonstrating improved repeatability as the tumor size increased. Inter-algorithm reproducibility was determined in three partitions and was found to be 58% for the four best performing groups, 70% for the set of groups meeting repeatability requirements, and 84% when all groups but the least performer were included. The best performing partition performed markedly better on tumors with equivalent diameters greater than 40 mm. Larger tumors benefitted by human editing but smaller tumors did not. One-fifth to one-half of the total variability came from sources independent of the algorithms. Segmentation boundaries differed substantially, not ony in overall volume but also in detail. CONCLUSIONS Nine of the 12 participating algorithms pass precision requirements similar to what is indicated in the QIBA Profile, with the caveat that the present study was not designed to explicitly evaluate algorithm profile conformance. Change in tumor volume can be measured with confidence to within ±14% using any of these nine algorithms on tumor sizes greater than 10 mm. No partition of the algorithms was able to meet the QIBA requirements for interchangeability down to 10 mm, although the partition comprising best performing algorithms did meet this requirement for a tumor size of greater than approximately 40 mm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Test-Retest and Inter-Rater Reliability Study of the Schedule for Oral-Motor Assessment in Persian Children

Objectives: Reliable and valid clinical tools to screen, diagnose, and describe eating functions and dysphagia in children are highly warranted. Today most specialists are aware of the role of assessment scales in the treatment of affected individuals. However, the problem is that the clinical tools used might be nonstandard, and worldwide, there is no integrated assessment performed to assess ...

متن کامل

Assessing the Validity and Reliability of the Persian Version of the Interpersonal Problem Solving Skills Assessment Tool in Schizophrenia

Objective: This study aimed to translate the Assessment of Interpersonal Problem-Solving Skills (AIPSS) into Persian and to evaluate the validity and reliability of the Persian version of AIPSS to use for adults with schizophrenia. Materials & Methods: In this methodological study, the translation process was performed according to the International Quality of Life Assessment (IQOLA) protocol....

متن کامل

Assessment of X-Ray Crosstalk in a Computed Tomography Scanner with Small Detector Elements Using Monte Carlo Method

Introduction: Crosstalk is a leakage of X-ray or light produced in a matrix of X-ray detectors or array of photodiodes in one element to other elements affecting on image contrast and spatial resolution. In this study, we assessed X-ray crosstalk in a computed tomography (CT) scanner with small detector elements to estimate the effect of various parameters such as X-ray tube voltage, detector e...

متن کامل

Test–retest reliability of mandibular morphology measurements on cone-beam computed tomography-synthesized cephalograms with random head positioning errors

BACKGROUND Cephalometric radiography has been used for orthodontic and surgical treatment planning and assessment, and for quantifying mandibular growth. However, it remains unclear how head positioning errors and the level of examiner experience affect the reliability of such morphometric measurements. The current study aimed to bridge the gap by determining the intra-, inter-rater, and inter-...

متن کامل

Dose Assessment in Computed Tomography Examination and Establishment of Local Diagnostic Reference Levels in Mazandaran, Iran

Background: Medical X-rays are the largest man-made source of public exposure to ionizing radiation. While the benefits of Computed Tomography (CT) are well known in accurate diagnosis, those benefits are not risk-free. CT is a device with higher patient dose in comparison with other conventional radiation procedures. Objective: This study is aimed at evaluating radiation dose to patients from ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Academic radiology

دوره 22 11  شماره 

صفحات  -

تاریخ انتشار 2015